ON SOME MEAN MATRIX INEQUALITES OF DYNAMICAL 

INTEREST 

IGOR RIVIN 

Abstract. Let A e SL(n, R). We show that for all n > 2 there exist 
dimensional strictly positive constants C„ such that 

f log p(AX)dX>C„ log ||A||, 

where ||A|| denotes the operator norm of A (which equals the 
largest singular value of A), p denotes the spectral radius, and 
the integral is with respect to the Haar measure on O n . The same 
result (with essentially the same proof) holds for the unitary group 
U„ in place of the orthogonal group. The result does not hold in 
dimension 2. This answers questions asked in 1 3 5 4 [ . We also dis- 
cuss what happens when the integral above is taken with respect 
to measure other than the Haar measure. 
We also give a simple proof that 

f \og\\Au\\da„ > 0, 

JS" 

with equality if and only if A 1 A = /„, (da„ is the standard measure 
on S n .) 



Introduction 

A. Wilkinson brought the following questions to the author's at- 
tention: 

Question 1. Let A e SL(n,R). Consider the coset = AO(n). Then is 
it true that 



r do r 

Jy{ Js" 



log HBwIlrfrfcr,, + 0, 
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whenever A is not orthogonal? (In the above integral, do denotes the 
Haar measure on the orthogonal group, normalized to be a probabil- 
ity measure; do n is the standard measure on S".) 

A version of this question is addressed in the paper |5[. In this 
note we give a very simple proof that the answer is in the affirmative 
(Theorem 

Question 2. Let A e SL(n, ]R), then 

I log p(AX)dX > C„ \ do \ \og\\Bu\\ddo n t 0, 

for some positive dimensional constant C n . 

The above is Question 6.6 in 0. The question is answered in 
dimension 2 in the paper |1J - in fact, the authors show that in that 
case one can take C n = 1, and replace > by = . 

In the papers |3, 5[ the question is also posed as to what happens 
when the measure on the orthogonal group is not the Haar measure 
(actually, the question is asked in the slightly different context.) Ob- 
viously the result is unchanged if a measure [i is in the same measure 
class as the Haar measure, but we note that in two dimensions the 
sign of the inequality in Question |2] can be reversed whenever [i is 
supported on a proper subset of 0(2) = S 1 - see Section [6]for details. 

The same question with the orthogonal group replaced by the 
unitary group is answered in the paper ||4|| (in arbitrary dimension). 
The dimensional constant is again equal to 1, though the inequality 
cannot be replaced by an equality. In |4J, Dedieu and Shub also 
conjecture a positive answer to Question |3 The same conjecture is 
made in |5[. 

In this note we give a simple proof of 

Theorem 3. Let A e SL(n,R). Then 

\og\\Au\\don > 0, 



I 



with equality if and only if A 1 A = I n , (and do n denotes the standard measure 
on the unit sphere). 

and, as one of the consequences, we answer Question [T] in the 
affirmative. 
We also show 
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Theorem 4. For all n > 2 there exist strictly positive constants C„ such 
that 

f log p(AX)dX>C n log ||A||, 

where \\A\\ denotes the operator norm of A (which equals the largest singular 
value of A), p denotes the spectral radius, and the integral is with respect to 
the Haar measure on O n . The same result (with essentially the same proof) 
holds for the unitary group U n in place of the orthogonal group. 

This result easily implies a positive answer to Question |2] in di- 
mensions bigger than 2. Theorem HI is false in dimension 2. Some 
discussion of why dimension 2 is different is given in Section |5J 

It should also be noted that Theorem |4| immediately implies the 
following 

Corollary 5. Let \i be an orthogonally invariant measure on SL(n, R), such 
that log Hxll G L 1 ( i u), and n > 3. Then 

\ log p(X)dp>C n \ logHAH^u. 

JSL(nM JsL(n,K) 

The same statement holds with C replacing R and "unitarily invariant" 
replacing "orthogonally invariant" . 

Proof. Integrate both sides of the inequality in Theorem |4| □ 

The plan of the paper is as follows: In Section [T] we summarize 
some necessary facts of linear algebra. In Section |2] we give a proof 
of Theorem |3| In Section |3] we outline our approach to the proof of 
Theorem 2| The actual proof falls naturally into two parts - one part 
covers the case where A* A is far from I n and is the content of Section|4| 
and the other part the case where it is close to the identity, and is the 
content of Section |5l This last section requires some simple results 
on perturbation theory, and these are covered in Section 13 

It should be remarked that no effort is made to estimate the constant 
C„ which appears in the statement in the Theorem|4|- in order to have 
a sharp estimate, the methods of the current paper will need to be 
combined with detailed analysis of the structure of the orthogonal 
groups, and this will be the subject of another paper. 

1. Generalities on matrices 

First, 

Definition 6. Let A e M mxn . Then the nonnegative square roots of the 
eigenvalues of the (positive semi-definite) matrix A 1 A are called the singular 
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values of A. We will denote individual singular values by o\ > ■ ■ ■ > o m , 
and we shall denote the ordered m-tuple (oi, o m ) by L(A). 

In this note we will only work with square matrices (although the 
results go through without change for non-square matrices), and our 
use of singular values will be localized to the following two lemmas: 

Lemma 7. Let A e M nxn , and let P e 0(n). Then 

L(A) = L(AP) = L(PA). 

Proof. Immediate. □ 

Lemma 8. Let A e M nxn , and let /R — > 1R be such that 

3= J f ((Au,Au))do„ < oo. 

Then 



lit 



3 = / / 



do„. 



Proof. It is sufficient to note that 

{Au,Au) = u l A l Au = (Pu) f Diag(L 2 (A))Pw, 
where P is an orthogonal matrix (hence an isometry of S"). □ 
Corollary 9. 



ddo n . 



f do [ log \\Bu\\do n = \ f log(Vcjf(AK- 

Proof. Lemma tells us that the inner integral does not depend on 
B G 3\, and Lemma |8] tells us how to evaluate that integral in terms 
of singular values of A. □ 

Corollary [l] tells us that Question [l] will be answered once we 
succeed in showing Theorem |3] (for diagonal A.) 

Theorem-Definition 10 (Gershgorin's Theorem). Let A £ M„(C). Let 
(2) Si = X 
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and let the row- and column- Gershgorin disks (respectively) be: 

(3) % = {zeC | \z- a u \ < r,-}, 

(4) Si = {z e C | \z - a u \ < s,}. 

Let now A be an eigenvalue of A. Then there exists a k and an I such that 
A e<R k and A e <S/. 

Proof. See, eg, Q or |7|. □ 

2. Proof of the Theorem [3] 
To prove Theorem |3l by Lemma |8j it will be sufficient to prove 
Theorem 11. 

3(fli,...,fl„) = I log V fl f M? da n > 0, 
J s" \tf ) 

whenever a\,...a n > 0, and n"=i «i = 1- Furthermore, equality holds only 
when a\ = ■ ■ ■ = a n = 1. 

Proof By the (weighted version of) the arithmetic-geometric mean 
inequality (see |6[), 

n n 

YL*^ -]\ a i ' 

i=l i=l 

with equality if and only if all of the U{ are equal, (the above uses the 
fact that Y£=\ xf = !■) Taking logs, we see that 



log 



(?,.Y- 



Yj >Yj x i l °^ 



with equality if and only if all the a, are equal. And integrating, we 
see that 

r " 

3(fli, ...,a n )> I 2_. A lo g(«0 = 

j i log(fl;) J x^dcr„ = n log(fl,-) I Xjda„ = 0, 



with equality if and only if a, = 1 for all i. 



□ 
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3. A LOWER BOUND ON AVERAGE SPECTRAL RADIUS 

In this section we consider the following average: 

31(A) = f log p(AX)dX, 

where p(M) denotes the spectral radius of the matrix M. The main 
theorem of the section will be the following estimate: 

Theorem 12. Let A £ SL(n, R), let n > 3, and let 0\(A) be the largest 
singular value of A. Then there exists a constant C n > 0, such that 

(5) 31(A) > C n logcri(A). 

Remark 13. Theorem Il2l is false for n = 2 and trivially true for n = 1 
(d = 1). 

To prove Theorem [121 we first make a couple of observations. First, 

Lemma 14. Lei A £ M n , lei E(A) be the singular values of A, and let p(X) 
be the spectral radius of the matrix X. Then 

f f(p(AY))dY= f /(p(Diag(EA))dY 

J On J O n 

Proof. Form the singular value decomposition of A, that is, write 
A = Pi Diag (L(A)) P 2 , where Pi,P 2 £ O n . Then, 

P (AY) = p (Diag (cr(A)) P2YP1) . 

The statement of the Lemma follows immediately. □ 

Lemma[l4|tells us that in the statement of TheoremIL?! it is enough 
to consider diagonal matrices A with positive entries. In the rest of this 
section we will labor under these assumptions. The strategy will be 
to first demonstrate a bound in any closed set of diagonal matrices 
not containing the identity matrix I n - this will be done in Section 
ID (particularly Corollary \T7\- and then show a bound in some small 
neighborhood of the identity - this will be the content of Section |SJ 
Our measure of the distance from the identity will be the quantity 

L(A) = max | log(cTi(A)|. 

!=1 

the top singular value 0\{A). From now on, A = Diag(rfi, . . . , d n ), with 
d\ > d2 > ■ ■ ■ d n > 0. We will assume that n^Li di = 1 when necessary 
- much of the time we will only use the fact that d\ > 1. 
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4. Away from identity 

Theorem 15. Let A = Diag(ii, . . . , d„), with d\ > dj, . . . , > d„ > 0, and 
let ||M||i < 1. Let A t = A(I + tM). Then, 

d x {\ + It) > p(A t ) > d x {\ - 2nt). 
Proof. Since (A t )ij = di(5ij + tMij), it follows that 

(6) ^(l+f)>l(A t )«l>^(l-0/ 
while 

(7) \n{A t )\ < dit 

Let 

n 

g{A t ) = {JuA t ). 

By a simple continuity argument (see, for example, |2J for a similar 
argument), the perturbation of the eigenvalue d\ of A stays in the 
connected component of the Gershgorin disk < R oa (A t ). Since by the 
above inequalities Eq. © and Eq. 0, 

min > di(l - 2t), 
max < cb(l - 2t), 

we see that after time t the connected component of 7?i(A t ) can only 
stay between the advertised bounds. □ 

Theorem 16. Let A = Diag(dl, . . . , d n ). For any e > 0, there is a constant 
c £ > 0, such that 

31(A) > c e (d t - e). 
Proof. First, let t be so small that log(l - 2nt ) > 1 - e. Now let 
S = {xeO n | Hx-Jlli < to}, 

and let 

= VoI(Sq) 
Ce Vo1(O b )' 

We see that 

J, ; log p(AX)dX 
Vol O n 

□ 
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Corollary 17. Let D be the set of all diagonal matrices with elements 
d\ > d 2 > ■ ■ ■ > d n > 0, and let D t be the set of those matrices Me D with 
di(M) > t > 1. Then there is a constant Q > such that 

m > c 

log(^(A)) f ' 

for all AeD t . 

Proof. Let s = log(i)/2. Then, by Theorem Il6l 

c s log di{A) 
31(A) > c s (log(di(A) - s) > 6 2 

where the last inequality holds for all A e £)f. Now set Q = c s /2. □ 

5. Close to the Identity 

In this section we shall prove the following: 

Theorem 18. Let n > 3. Let A = Diag^xp^f), . . . ,exp(d n t)), with 
d\ > ■ ■ ■ > d n , with max(|rfi|, \d n \) = 1. and 



it 

di = 0. 



Then there exist constants C„ and e n , such that for t < e n , 
(8) %{A) > C n td x 

forO <t < e n . 

Remark 19. The argument which follows can be made to give quite 
explicit lower bounds for both e„ and C„. However, this does not 
seem useful, since it is quite clear that the bounds will not be close to 
the truth. It seems plausible that a combination of the methods of this 
note with a detailed understanding of the structure of the orthogonal 
group will give a tight result, but we shall not attempt to do this here. 
It should, on the other hand, be noted that not using any structural 
property of the orthogonal group means that the arguments in this 
note work just as well for cosets of unitary groups. 

To prove Theorem [TBI we shall first need the following trivial but 
crucial observation: 

Observation 20. Let A e SL(n,R), and let X e 0„(R). Then p(AX) > 1. 
Proof det AX = 1. □ 
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Now, recall that 

3i{A) = f log p(AX)dx. 

By the observation above, it follows that the integrand is everywhere 
nonnegative, and so to prove a lower bound such as that of Theorem 
TBIit will be enough to show the inequality © with jfl.(A) replaced by 



M S (A)= f log p(AX)dx. 

JSCOn 



>SQO n 

Let us pick S to be a small neighborhood around a matrix X e O n . 

Notice that all eigenvalues of X £ O n have absolute value 1. We 
will chose X to have a simple eigenvalue Ao, such that 

/nx d lo § I A (Diag(exp(frfi), . . . , exp(td n ))X )\ 

(9) j > qrfil, 

for some constant C. By analyticity of A (Theorem l23t, inequality © 
suffices to show that 

(10) 31(A) > C n td v 

for t sufficiently small, and by the Observation |20] this will suffice to 
show Theorem UHl 



5.1. Proof of inequality ©. First, we need to find a suitable orthog- 
onal matrix Xo. There will be two cases, depending on the parity of 
n. 

First case is that of odd n. In such a case, we pick X to be a rotation 
by 180 degrees around the axis e\ = (1, 0, ... , 0), and our A (X ) = 1. 
Its eigenvector is v Ao = e\. By Lemma l25l 

A W u w 
— = Mx)d lf 

and we are done. 

Second case is that of n even. In that case, we pick X to fix the 
span of e 3/ . . . , e n and to rotate the span of e\ and e 2 by 90 degrees. In 
this case, A (X ) = i, and the eigenvector of A is 

1 

v Ao = — (1,1,0,... ,0). 
V2 

By Lemma l25l 
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If d.2 > 0, we are done, since d\ + d-i > d\. Since 

n 

1=1 

it follows that 

n n 

d\ = -/ l dj = Y' i \di\. 

i=2 i=2 

Since \d n \ > \d n -\ > \dz\, it follows that d\>{n- V)\d2\, and so 

n-2, 

ai+a 2 > -di, 

n-1 

so as long as n > 2 we are done. Notice that the argument breaks 
down when n = 2, and in fact, in that case the result is false. 

6. What about dimension 22 

The function f{d\, . . .,d n ) = 3I(Diag(exp(rfi), . . . , exp(d n ))) restricted 
to the set 

n 

2, d\. ■ .d n = 

i=i 

always has a minimum at the origin. In dimension 2, however, 
the minimum is an actual critical point. This is not hard to see 
using Lemma|25j together with the observation that the group 0(2) is 
abelian, and thus every x e 0(2) has (1, i) and (z, 1) for an orthonormal 
basis of eigenvectors. Thus, at d\ = 0, 

d log(p(Diag(exp(rfi), exp(-rfi))X) _ 
dd\ 

for every X G 0(2). In dimension n > 2 the function 31 is not smooth 
at the origin. 

As a matter of fact, in dimension 2 many of the computations can 
be made much more explicit. Indeed, a general element Rg of SO(2) 
has the form: 

/ cos(0) sin(0)\ 
(-sin(0) cos(0)]' 

and so if A = Diag(a, I /a), then, denoting t = (a + l/a)/2, 

tr AR e = 2tcos(d), 

and so 

p(AR e = max(|fcos(0) + ^t 2 cos 2 (d) - 1|, \tco&(0) - ^t 2 cos 2 (d) - 1|) . 
A simple computation shows: 



R t 
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Lemma 21. Whenn | cos(0)| <l/t, then p(AR g ) = 1. (in other words, the 
matrix AR g is elliptic. 

We have the following corollary (which answers a question of |5|): 

Corollary 22. Let jj.be a measure on SO(2) such that the support of p. omits 
some interval. Then, there exists a matrix A e SL(2, R), such that 

\ log p(AR e )dp. = Q. 

JSO(2) 

Proof. Suppose that the support of p omits [a,b]. Let c = (a + b)/2 
(a,b,c are all modulo 2n.) Let s = \(a - b)/2\, let a = 1/ cos(s). and 
let jS be positive, and such that j6 + < 2a (in other words, /3 < 
exp(arccosh(«)).) Let A = Diag(j6, 1//3)R_ C . Then Lemma LTTI shows 
that A is the matrix whose existence is asserted in the corollary. □ 

7. Some perturbation theory 

In this section we will recall some formulas of perturbation theory 
(an exhaustive treatment can be found in the classic |8[), and derive 
some estimates needed in the rest of this paper. 

Theorem 23. Let T be an nxn matrix (thought of as a point in C" 2 , and 
let Abe a simple eigenvalue of T. Then there is a neighborhood of T where 
A is a holomorphic function ofT. In particular, A is C°° in a neighborhood 
ofT. 

Proof. This follows from generalities on analyticity of (simple) roots 
of polynomials as functions of their coefficients. See |8[ for some 
further details. □ 

Theorem 24. Let Abe a simple eigenvalue of T(0). Then 

, (T(x) , „ \ 

where P A is the orthogonal projection onto the eigenspace of A. 

Proof. See ilp. 79]. □ 

Lemma 25. Let T(x) = Diag^xp^x), . . . , exp(d n x))T, and let let A(x) be 
a simple eigenvalue ofT{x). Let the unit eigenvector of A be v A = (vi, . . . , v n ). 
Then, 

A M 1M rji 12 
— = A(x)l a d i \v i \ 2 . 

i=i 

In particular, the logarithmic derivative of A(x) is real. 
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Proof. Use Theorem l24l and the observation that 

T(x)P A(x) = A(x)P A(x) , 

to write 

—j— = tr (Diag(rfi, . . . , d n )P m ) . 
Now, note that {P\( X ))ij = v{o~j. □ 
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